A decomposition method with minimum communication amount for parallelization of multi-dimensional FFTs

نویسندگان

  • Truong Vinh Truong Duy
  • Taisuke Ozaki
چکیده

The fast Fourier transform (FFT) is undoubtedly an essential primitive that has been applied in various fields of science and engineering. In this paper, we present a decomposition method for parallelization of multi-dimensional FFTs with smallest communication amount for all ranges of the number of processes compared to previously proposed methods. This is achieved by two distinguishing features: adaptive decomposition and transpose order awareness. In the proposed method, the FFT data are decomposed based on a row-wise basis that maps the multi-dimensional data into one-dimensional data, and translates the corresponding coordinates from multi-dimensions into one-dimension so that the resultant one-dimensional data can be divided and allocated equally to the processes. As a result, differently from previous works that have the dimensions of decomposition pre-defined, our method can adaptively decompose the FFT data on the lowest possible dimensions depending on the number of processes. In addition, this row-wise decomposition provides plenty of alternatives in data transpose, and different transpose order results in different amount of communication. We identify the best transpose orders with smallest communication amounts for the 3-D, 4-D, and 5-D FFTs by analyzing all possible cases. Given both communication efficiency and scalability, our method is promising in development of highly efficient parallel packages for the FFT. ∗Corresponding author. Tel.: +81 761 51 1987 Email addresses: duytvt@{jaist.ac.jp,issp.u-tokyo.ac.jp} (Truong Vinh Truong Duy), [email protected] (Taisuke Ozaki) Preprint submitted to Computer Physics Communications February 26, 2013 ar X iv :1 30 2. 61 89 v1 [ cs .N A ] 6 F eb 2 01 3

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dfg-forschergruppe \spc" Fakultt at F Ur Mathematik Parallelization of Multi-grid Methods Based on Domain Decomposition Ideas

In the paper, the parallelization of multi-grid methods for solving second-order elliptic boundary value problems in two-dimensional domains is discussed. The parallelization strategy is based on a non-overlapping domain decomposition data structure such that the algorithm is well-suited for an implementation on a parallel machine with MIMD architecture. For getting an algorithm with a good par...

متن کامل

A New Compromise Decision-making Model based on TOPSIS and VIKOR for Solving Multi-objective Large-scale Programming Problems with a Block Angular Structure under Uncertainty

This paper proposes a compromise model, based on a new method, to solve the multi-objective large-scale linear programming (MOLSLP) problems with block angular structure involving fuzzy parameters. The problem involves fuzzy parameters in the objective functions and constraints. In this compromise programming method, two concepts are considered simultaneously. First of them is that the optimal ...

متن کامل

MPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling

Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...

متن کامل

Empirical Mode Decomposition based Adaptive Filtering for Orthogonal Frequency Division Multiplexing Channel Estimation

This paper presents an empirical mode decomposition (EMD) based adaptive filter (AF) for channel estimation in OFDM system.  In this method, length of channel impulse response (CIR) is first approximated using Akaike information criterion (AIC). Then, CIR is estimated using adaptive filter with EMD decomposed IMF of the received OFDM symbol. The correlation and kurtosis measures are used to sel...

متن کامل

A new transpose split method for three-dimensional FFTs: performance on an Origin2000 and Alphaserver cluster

We discuss a new transpose split method for parallel computations of three-dimensional Fourier transforms. By splitting the data along two dimensions over the processors it allows for a higher degree of parallelization than the original transpose split method. The traditional transpose split method involves one alltoall communication. The new method involves two communication steps in which eac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Physics Communications

دوره 185  شماره 

صفحات  -

تاریخ انتشار 2014